Learning Complex Rare Categories with Dual Heterogeneity
نویسندگان
چکیده
In the era of big data, it is often the case that the selfsimilar rare categories in a large data set are of great importance, such as the malicious insiders in big organizations, and the IC devices with defects in semiconductor manufacturing. Furthermore, such rare categories often exhibit multiple types of heterogeneity, such as the task heterogeneity, which originates from data collected in multiple domains, and the view heterogeneity, which originates from multiple information sources. Existing methods for learning rare categories mainly focus on the homogeneous settings, i.e., a single task and a single view. In this paper, for the first time, we study complex rare categories with both task and view heterogeneity, and propose a novel optimization framework named MLID. It introduces a boundary characterization metric to capture the sharp changes in density near the boundary of the rare categories in the feature space, and constructs a graph-based model to leverage both task and view heterogeneity. Furthermore, MLID integrates them in a way of mutual benefit. We also present an effective algorithm to solve this framework, analyze its performance from various aspects, and demonstrate its effectiveness on both synthetic and real datasets.
منابع مشابه
Toward a dual-learning systems model of speech category learning
More than two decades of work in vision posits the existence of dual-learning systems of category learning. The reflective system uses working memory to develop and test rules for classifying in an explicit fashion, while the reflexive system operates by implicitly associating perception with actions that lead to reinforcement. Dual-learning systems models hypothesize that in learning natural c...
متن کاملThe effects of dual verbal and visual tasks on featural vs
Many studies have examined the distinction between featureand relation-based categories (Gentner, 2005; Genter & Kurtz, 2005; Jung & Hummel, 2009; Tomlinson & Love, 2011). Those findings suggest that featural and relationl categories have fundamentally different learning algorithms, where relational categories rely on explicit representations and thus require working memory and attention, as op...
متن کاملThe effects of dual verbal and visual tasks on featural vs. relational category learning
Many studies have examined the distinction between featureand relation-based categories (Gentner, 2005; Genter & Kurtz, 2005; Jung & Hummel, 2009; Tomlinson & Love, 2011). Those findings suggest that featural and relationl categories have fundamentally different learning algorithms, where relational categories rely on explicit representations and thus require working memory and attention, as op...
متن کاملA Cognitive Model of Recall Motivated by Inductive Learning
A dual-process model for recall is described, which is based on an architecture for the inductive learning of symbolic categories. The dual-process mechanism (generate-and-recognize) is shown to follow directly as a consequence of the inductive learning method used.
متن کاملRapid Learning of Morphological Paradigms
The present study presents a novel paradigm for testing the ability for adults to rapidly learn novel morphological categories in the wake of irrelevant information: specifically number markings intermixed with irrelevant gender cues. Using an artificial language learning paradigm, participants were exposed to picture-sound pairs in which pictures of animals varied by number (singular, dual and...
متن کامل